AITopics | final revision

incorporate feedback into our final revision. 4 [R1]: " I don't exactly see if small batch vs large batch captures this phenomenon; if yes should say explicitly. "

Neural Information Processing SystemsFeb-13-2026, 21:02:44 GMT

We thank the reviewers for the detailed and insightful reviews. As the reviews noted, our work 1) introduces "novel Smith et al. [2017] make an explicit connection between small vs. large batch "A small discussion on if the phenomenon has been observed for different datasets/tasks with different optimizers" The phenomenon may not be true for other optimizers such as Adam, though. "concept of "memorizable and generalizable", though intuitive, is sketchy and not formally explained ... authors We acknowledge that the terms "memorizable" and "generalizable" are potentially confusing. We will revise our terminology to clarify this distinction. By "inherently noisy", we refer to the fact that high noise in the datapoints will necessitate larger sample complexity.

artificial intelligence, machine learning, noise, (16 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

reviewers ' questions below and will incorporate feedback into the final revision

Neural Information Processing SystemsOct-3-2025, 03:54:26 GMT

We thank the reviewers for the detailed and insightful reviews. As the reviewers noted, our work 1) contributes to "a Thank you for the valuable feedback on this section -- we will incorporate this in our next revision. The intuition for the proof of Theorem 3.3 is that the optimization problem is convex over the space of probability By weak regularization, we refer to the fact that λ 0 for our Theorem 4.1 to hold. The difficulty with ReLU networks is that if the gradient flow pushes neurons towards 0, issues of differentiability arise. One potential approach to circumvent this issue is arguing that with correct initialization, the iterates will never reach 0. This is an interesting direction for future work and we thank the reviewer for this suggestion.

artificial intelligence, machine learning, reviewer, (15 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.32)

Add feedback

63c17d596f401acb520efe4a2a7a01ee-AuthorFeedback.pdf

Neural Information Processing SystemsOct-3-2025, 02:08:51 GMT

ampprior, artificial intelligence, machine learning, (19 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.97)

Add feedback

FUSS YFCC100M Supervised p

Neural Information Processing SystemsOct-2-2025, 12:41:00 GMT

However, SNR evaluation without signal-level ground truth would still be a problem with CHiME-5.

evaluation, experiment, final revision, (15 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.49)

Add feedback

12e59a33dea1bf0630f46edfe13d6ea2-AuthorFeedback.pdf

Neural Information Processing SystemsOct-2-2025, 03:52:29 GMT

artificial intelligence, machine learning, walksat, (17 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.51)

Add feedback

strongly-convex-concave minimax problems first, which we will add in the final revision

Neural Information Processing SystemsOct-2-2025, 00:07:43 GMT

We thank all the reviewers for their constructive comments. The intuition behind Algorithm 1 stems from a "conceptual" version of DIAG (also specified in Algorithm 1, Step 4), which is inspired from the conceptual version of Mirror-Prox (MP) (cf. We agree with and will include, the reviewer's comment, that the non-smoothness of We will devote more space to explaining the DIAG algorithm and discussing more related works. We will add a precise justification (which was omitted due to the lack of space) in the next revision. We discuss important ones below.

artificial intelligence, final revision, machine learning, (13 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Search (0.41)
Information Technology > Artificial Intelligence > Machine Learning (0.31)

Add feedback

incorporate feedback into our final revision. 4 [R1]: " I don't exactly see if small batch vs large batch captures this phenomenon; if yes should say explicitly. "

Neural Information Processing SystemsAug-20-2025, 00:46:19 GMT

We thank the reviewers for the detailed and insightful reviews. As the reviews noted, our work 1) introduces "novel Smith et al. [2017] make an explicit connection between small vs. large batch "A small discussion on if the phenomenon has been observed for different datasets/tasks with different optimizers" The phenomenon may not be true for other optimizers such as Adam, though. "concept of "memorizable and generalizable", though intuitive, is sketchy and not formally explained ... authors We acknowledge that the terms "memorizable" and "generalizable" are potentially confusing. We will revise our terminology to clarify this distinction. By "inherently noisy", we refer to the fact that high noise in the datapoints will necessitate larger sample complexity.

incorporate feedback, noise, small learning rate, (14 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Filters

Collaborating Authors

final revision

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

incorporate feedback into our final revision. 4 [R1]: " I don't exactly see if small batch vs large batch captures this phenomenon; if yes should say explicitly. "

12e59a33dea1bf0630f46edfe13d6ea2-AuthorFeedback.pdf

d464b5ac99e74462f321c06ccacc4bff-AuthorFeedback.pdf

63c17d596f401acb520efe4a2a7a01ee-AuthorFeedback.pdf

reviewers ' questions below and will incorporate feedback into the final revision

63c17d596f401acb520efe4a2a7a01ee-AuthorFeedback.pdf

FUSS YFCC100M Supervised p

12e59a33dea1bf0630f46edfe13d6ea2-AuthorFeedback.pdf

strongly-convex-concave minimax problems first, which we will add in the final revision

incorporate feedback into our final revision. 4 [R1]: " I don't exactly see if small batch vs large batch captures this phenomenon; if yes should say explicitly. "